PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa12g046890.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 734aa    MW: 81773.1 Da    PI: 6.3889
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa12g046890.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.11.5e-18106157556
                     SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++t +q++e+e++Fe++++p +++r +L+++lgLt rqVk WFqN+R++ k
  Csa12g046890.1 106 HRHTVHQIQEMEAVFEETSHPGENKRLRLSEELGLTPRQVKCWFQNKRTQTK 157
                     5799********************************************9987 PP

2START115.29.5e-372564872206
                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv...........dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                     la ++ qel+ + + +ep+W+k +  +n+ ++l ++e +k            ++ ea+ra++vv m++++l + +ld   qW+e +     +a++
  Csa12g046890.1 256 LAVSCCQELIIMCDINEPLWTKKR-LDNESVCLNEEEYKKMflwpptddddrFRREASRAKAVVMMNSMTLAKAFLDAD-QWSELFCpivsSAKV 348
                     788999******************.6666677777766666888889999999**************************.****9999999**** PP

                     EEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.......-TTSEE-EESSEEEEEEEECTCEEE CS
           START  82 levissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppe......sssvvRaellpSgiliepksnghsk 163
                     ++ iss      g   lm+a lq+ spl p R+ +f+Ry ++  ++ +w+ivd  ++s +   +      +  +   ++ pSg++i++++ng+s+
  Csa12g046890.1 349 IQIISSKvsgtdGSFLLMYAGLQVVSPLLPtREGYFLRYVERkGKKVKWIIVDFPIESFRGFVKpastaaTTTIGLYRRKPSGCIIQEMANGYSE 443
                     ******9999********************************556668*****99987654332223455444444559**************** PP

                     EEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 164 vtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                     vtwvehv++++++  ++++r  vksg+a+ga +w+a l+rqce+
  Csa12g046890.1 444 VTWVEHVEVEEKQEqDEVVREYVKSGVAFGAERWLAVLKRQCER 487
                     *************999**************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.3E-1789160IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.2E-1895160IPR009057Homeodomain-like
PROSITE profilePS5007116.14799159IPR001356Homeobox domain
SMARTSM003896.9E-16100163IPR001356Homeobox domain
CDDcd000862.48E-15102160No hitNo description
PfamPF000464.9E-16106157IPR001356Homeobox domain
PROSITE profilePS5084838.962246490IPR002913START domain
SuperFamilySSF559612.75E-23247489No hitNo description
CDDcd088758.58E-92250486No hitNo description
SMARTSM002343.5E-20255487IPR002913START domain
PfamPF018521.1E-29256487IPR002913START domain
SuperFamilySSF559612.2E-9508694No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 734 aa     Download sequence    Send to blast
METKDKKLMY GGDYQVMKSS EDVSYSDKIF GSTSSSSPTA TIQNPNLKFT AFDNLNFPYM  60
TPKEEEYGVM SMIGSGSGVS TGSGHYLVEN TAIEHESPAT KRSRSHRHTV HQIQEMEAVF  120
EETSHPGENK RLRLSEELGL TPRQVKCWFQ NKRTQTKAQQ DRRANAMLKA ENDTLKTESQ  180
DLQSSLQCLS CSSCGNNLRL ENTRLRQELD RLRRIVSMRN TPPPSQEIAC FFPETDSNNN  240
NDVLIAEEEK AIAMELAVSC CQELIIMCDI NEPLWTKKRL DNESVCLNEE EYKKMFLWPP  300
TDDDDRFRRE ASRAKAVVMM NSMTLAKAFL DADQWSELFC PIVSSAKVIQ IISSKVSGTD  360
GSFLLMYAGL QVVSPLLPTR EGYFLRYVER KGKKVKWIIV DFPIESFRGF VKPASTAATT  420
TIGLYRRKPS GCIIQEMANG YSEVTWVEHV EVEEKQEQDE VVREYVKSGV AFGAERWLAV  480
LKRQCERMAS LMATKITDLG VIPSVAARKN LMKLSQRMVK TFCLNISNSY GQASTKDTVR  540
IVTRKVSDGF VPCAVSVTYL PYSHHKVFDL LRDNQRLSQL EILFNGSSFQ EVAHIANGSH  600
LGNCISLLRI NMESNTWHNV ELMLQETCTD NSGSLLVFST VDADAVQLAM NGKDPSNIPL  660
LPVGFSVVPV NPSDGVEGTS VNLPSCLLTV AMQVLGTSAA DAKLDLSTVS AINNRICATV  720
NRITSALVND VGN*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010449572.10.0PREDICTED: homeobox-leucine zipper protein HDG4-like
SwissprotQ8L7H40.0HDG4_ARATH; Homeobox-leucine zipper protein HDG4
TrEMBLR0GP440.0R0GP44_9BRAS; Uncharacterized protein
STRINGAT4G17710.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17710.10.0homeodomain GLABROUS 4